The effect of minor allele frequency on the likelihood of obtaining false positives
نویسندگان
چکیده
Determining the most promising single-nucleotide polymorphisms (SNPs) presents a challenge in genome-wide association studies, when hundreds of thousands of association tests are conducted. The power to detect genetic effects is dependent on minor allele frequency (MAF), and genome-wide association studies SNP arrays include SNPs with a wide distribution of MAFs. Therefore, it is critical to understand MAF's effect on the false positive rate.Data from the Framingham Heart Study simulated data (Problem 3, with answers) was used to examine the effects of varying MAFs on the likelihood of false positives. Replication set 1 was used to generate 1 million permutations of case/control status in unrelated individuals. Logistic regression was used to test for the association between each SNP and myocardial infarction using an additive model. We report the number of "significant" tests by MAF at alpha = 10-4, 10-5, and 10-6.Common SNPs exhibited fewer false positives than expected. At alpha = 10-4, SNPs with MAF 25% and 50% resulted in 69.2 [95%CI: 62.8-75.6] and 70.8 [95%CI: 61.3-80.4] false positives, respectively, compared to 100 expected. Rare SNPs exhibited more variability but did not show more false-positive results than expected by chance. However, at alpha = 10-4, MAF = 5% exhibited significantly more false positives (105.5 [95%CI: 81-130.1]) than MAF = 25% and 50%. Similar results were seen at the other alpha values.These results suggest that removal of low MAF SNPs from analysis due to concerns about inflated false-positive results may not be appropriate.
منابع مشابه
Improved Procedure for Screening Expression Libraries for Novel Autoantigens
The standard method for immunoscreening of a cDNA expression library is time-consuming becauseof the production of a large proportion of false positives during the first and second round of screening.This problem is more important when a sensitive chemiluminescence detection system is used. Due tothe high sensitivity of the detection system, there is a need to avoid false posi...
متن کاملA New Statistic to Evaluate Imputation Reliability
BACKGROUND As the amount of data from genome wide association studies grows dramatically, many interesting scientific questions require imputation to combine or expand datasets. However, there are two situations for which imputation has been problematic: (1) polymorphisms with low minor allele frequency (MAF), and (2) datasets where subjects are genotyped on different platforms. Traditional mea...
متن کاملThe False Discovery Rate in Simultaneous Fisher and Adjusted Permutation Hypothesis Testing on Microarray Data
Background and Objectives: In recent years, new technologies have led to produce a large amount of data and in the field of biology, microarray technology has also dramatically developed. Meanwhile, the Fisher test is used to compare the control group with two or more experimental groups and also to detect the differentially expressed genes. In this study, the false discovery rate was investiga...
متن کاملIdentification of the Rare, Four Repeat Allele of IL-4 Intron-3 VNTR Polymorphism in Indian Populations
Background: Cytokines are cell signaling molecules which upon release by cells facilitate the recruitment of immune-modulatory cells towards the sites of inflammation. Genetic variations in cytokine genes are shown to regulate their production and affect the risk of infectious as well as autoimmune diseases. Intron-3 of interleukin-4 gene (IL-4) harbors 70-bp variable number of tandem repeats (...
متن کاملFrequency of two VKORC1 gene variants and its correlation with warfarin maintenance dose
Warfarin is a commonly-prescribed anticoagulant used to treat and prevent thromboembolic events. The requirement for varying doses of warfarin depends on genetic and environmental components. In this study, the frequency of two single-nucleotide polymorphic variants of the vitamin K epoxide reductase complex subunit 1 (VKORC1) gene (1173 C>T (rs9934438) and 3730 G>A (rs7294)) and its correlatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 3 شماره
صفحات -
تاریخ انتشار 2009